Group Members : Tharun Prabhakar, Niranjan Kumar, Sumit Kumar¶
| DRG Definition | Provider Id | Provider Name | Provider Street Address | Provider City | Provider State | Provider Zip Code | Hospital Referral Region Description | Total Discharges | Average Covered Charges | Average Total Payments | Average Medicare Payments | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 039 - EXTRACRANIAL PROCEDURES W/O CC/MCC | 10001 | SOUTHEAST ALABAMA MEDICAL CENTER | 1108 ROSS CLARK CIRCLE | DOTHAN | AL | 36301 | AL - Dothan | 91 | $32963.07 | $5777.24 | $4763.73 |
| 1 | 039 - EXTRACRANIAL PROCEDURES W/O CC/MCC | 10005 | MARSHALL MEDICAL CENTER SOUTH | 2505 U S HIGHWAY 431 NORTH | BOAZ | AL | 35957 | AL - Birmingham | 14 | $15131.85 | $5787.57 | $4976.71 |
| 2 | 039 - EXTRACRANIAL PROCEDURES W/O CC/MCC | 10006 | ELIZA COFFEE MEMORIAL HOSPITAL | 205 MARENGO STREET | FLORENCE | AL | 35631 | AL - Birmingham | 24 | $37560.37 | $5434.95 | $4453.79 |
| 3 | 039 - EXTRACRANIAL PROCEDURES W/O CC/MCC | 10011 | ST VINCENT'S EAST | 50 MEDICAL PARK EAST DRIVE | BIRMINGHAM | AL | 35235 | AL - Birmingham | 25 | $13998.28 | $5417.56 | $4129.16 |
| 4 | 039 - EXTRACRANIAL PROCEDURES W/O CC/MCC | 10016 | SHELBY BAPTIST MEDICAL CENTER | 1000 FIRST STREET NORTH | ALABASTER | AL | 35007 | AL - Birmingham | 18 | $31633.27 | $5658.33 | $4851.44 |
(163065, 12)
Index(['DRG Definition', 'Provider Id', 'Provider Name',
'Provider Street Address', 'Provider City', 'Provider State',
'Provider Zip Code', 'Hospital Referral Region Description',
' Total Discharges ', ' Average Covered Charges ',
' Average Total Payments ', 'Average Medicare Payments'],
dtype='object')DRG Definition object Provider Id int64 Provider Name object Provider Street Address object Provider City object Provider State object Provider Zip Code int64 Hospital Referral Region Description object Total Discharges int64 Average Covered Charges object Average Total Payments object Average Medicare Payments object dtype: object
Index(['DRG Definition', 'Provider Id', 'Provider Name',
'Provider Street Address', 'Provider City', 'Provider State',
'Provider Zip Code', 'Hospital Referral Region Description',
'Total Discharges', 'Average Covered Charges', 'Average Total Payments',
'Average Medicare Payments'],
dtype='object')| DRG Definition | Provider Id | Provider Name | Provider Street Address | Provider City | Provider State | Provider Zip Code | Hospital Referral Region Description | Total Discharges | Average Covered Charges | Average Total Payments | Average Medicare Payments | Total Payment | Medicare Payment amount | Total Covered Charge Amount | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 039 - EXTRACRANIAL PROCEDURES W/O CC/MCC | 10001 | SOUTHEAST ALABAMA MEDICAL CENTER | 1108 ROSS CLARK CIRCLE | DOTHAN | AL | 36301 | AL - Dothan | 91 | 32963.07 | 5777.24 | 4763.73 | 525728.84 | 433499.43 | 2999639.37 |
| 1 | 039 - EXTRACRANIAL PROCEDURES W/O CC/MCC | 10005 | MARSHALL MEDICAL CENTER SOUTH | 2505 U S HIGHWAY 431 NORTH | BOAZ | AL | 35957 | AL - Birmingham | 14 | 15131.85 | 5787.57 | 4976.71 | 81025.98 | 69673.94 | 211845.90 |
| 2 | 039 - EXTRACRANIAL PROCEDURES W/O CC/MCC | 10006 | ELIZA COFFEE MEMORIAL HOSPITAL | 205 MARENGO STREET | FLORENCE | AL | 35631 | AL - Birmingham | 24 | 37560.37 | 5434.95 | 4453.79 | 130438.80 | 106890.96 | 901448.88 |
| 3 | 039 - EXTRACRANIAL PROCEDURES W/O CC/MCC | 10011 | ST VINCENT'S EAST | 50 MEDICAL PARK EAST DRIVE | BIRMINGHAM | AL | 35235 | AL - Birmingham | 25 | 13998.28 | 5417.56 | 4129.16 | 135439.00 | 103229.00 | 349957.00 |
| 4 | 039 - EXTRACRANIAL PROCEDURES W/O CC/MCC | 10016 | SHELBY BAPTIST MEDICAL CENTER | 1000 FIRST STREET NORTH | ALABASTER | AL | 35007 | AL - Birmingham | 18 | 31633.27 | 5658.33 | 4851.44 | 101849.94 | 87325.92 | 569398.86 |
DRG Definition object Provider Id int64 Provider Name object Provider Street Address object Provider City object Provider State object Provider Zip Code int64 Hospital Referral Region Description object Total Discharges int64 Average Covered Charges float64 Average Total Payments float64 Average Medicare Payments float64 Total Payment float64 Medicare Payment amount float64 Total Covered Charge Amount float64 dtype: object
| Provider Id | Provider Zip Code | Total Discharges | Average Covered Charges | Average Total Payments | Average Medicare Payments | Total Payment | Medicare Payment amount | Total Covered Charge Amount | |
|---|---|---|---|---|---|---|---|---|---|
| count | 163065.000000 | 163065.000000 | 163065.000000 | 163065.000000 | 163065.000000 | 163065.000000 | 1.630650e+05 | 1.630650e+05 | 1.630650e+05 |
| mean | 255569.865428 | 47938.121908 | 42.776304 | 36133.954224 | 9707.473804 | 8494.490964 | 4.089749e+05 | 3.554069e+05 | 1.508465e+06 |
| std | 151563.671767 | 27854.323080 | 51.104042 | 35065.365931 | 7664.642598 | 7309.467261 | 6.592961e+05 | 5.766743e+05 | 2.531100e+06 |
| min | 10001.000000 | 1040.000000 | 11.000000 | 2459.400000 | 2673.000000 | 1148.900000 | 3.031193e+04 | 1.263790e+04 | 2.971892e+04 |
| 25% | 110092.000000 | 27261.000000 | 17.000000 | 15947.160000 | 5234.500000 | 4192.350000 | 1.215319e+05 | 1.001459e+05 | 3.841509e+05 |
| 50% | 250007.000000 | 44309.000000 | 27.000000 | 25245.820000 | 7214.100000 | 6158.460000 | 2.255856e+05 | 1.915830e+05 | 7.828449e+05 |
| 75% | 380075.000000 | 72901.000000 | 49.000000 | 43232.590000 | 11286.400000 | 10056.880000 | 4.513765e+05 | 3.930409e+05 | 1.663046e+06 |
| max | 670077.000000 | 99835.000000 | 3383.000000 | 929118.900000 | 156158.180000 | 154620.810000 | 6.435718e+07 | 5.034036e+07 | 1.796832e+08 |
| DRG Definition | Provider Name | Provider Street Address | Provider City | Provider State | Hospital Referral Region Description | |
|---|---|---|---|---|---|---|
| count | 163065 | 163065 | 163065 | 163065 | 163065 | 163065 |
| unique | 100 | 3201 | 3326 | 1977 | 51 | 306 |
| top | 194 - SIMPLE PNEUMONIA & PLEURISY W CC | GOOD SAMARITAN HOSPITAL | 100 MEDICAL CENTER DRIVE | CHICAGO | CA | CA - Los Angeles |
| freq | 3023 | 633 | 183 | 1505 | 13064 | 3653 |
number of duplicate rows: (0, 15)
| total | percent | |
|---|---|---|
| DRG Definition | 0 | 0.0 |
| Provider Id | 0 | 0.0 |
| Provider Name | 0 | 0.0 |
| Provider Street Address | 0 | 0.0 |
| Provider City | 0 | 0.0 |
| Provider State | 0 | 0.0 |
| Provider Zip Code | 0 | 0.0 |
| Hospital Referral Region Description | 0 | 0.0 |
| Total Discharges | 0 | 0.0 |
| Average Covered Charges | 0 | 0.0 |
| Average Total Payments | 0 | 0.0 |
| Average Medicare Payments | 0 | 0.0 |
| Total Payment | 0 | 0.0 |
| Medicare Payment amount | 0 | 0.0 |
| Total Covered Charge Amount | 0 | 0.0 |
['Provider Id', 'Provider Zip Code', 'Total Discharges', 'Average Covered Charges', 'Average Total Payments', 'Average Medicare Payments', 'Total Payment', 'Medicare Payment amount', 'Total Covered Charge Amount']
['DRG Definition', 'Provider Name', 'Provider Street Address', 'Provider City', 'Provider State', 'Hospital Referral Region Description']
<seaborn.axisgrid.PairGrid at 0x14259a95410>
| DRG Definition | Provider Id | Provider Name | Provider Street Address | Provider City | Provider State | Provider Zip Code | Hospital Referral Region Description | Total Discharges | Average Covered Charges | Average Total Payments | Average Medicare Payments | Total Payment | Medicare Payment amount | Total Covered Charge Amount | Provider Region | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 039 - EXTRACRANIAL PROCEDURES W/O CC/MCC | 10001 | SOUTHEAST ALABAMA MEDICAL CENTER | 1108 ROSS CLARK CIRCLE | DOTHAN | AL | 36301 | AL - Dothan | 91 | 32963.07 | 5777.24 | 4763.73 | 525728.84 | 433499.43 | 2999639.37 | South |
| 1 | 039 - EXTRACRANIAL PROCEDURES W/O CC/MCC | 10005 | MARSHALL MEDICAL CENTER SOUTH | 2505 U S HIGHWAY 431 NORTH | BOAZ | AL | 35957 | AL - Birmingham | 14 | 15131.85 | 5787.57 | 4976.71 | 81025.98 | 69673.94 | 211845.90 | South |
| 2 | 039 - EXTRACRANIAL PROCEDURES W/O CC/MCC | 10006 | ELIZA COFFEE MEMORIAL HOSPITAL | 205 MARENGO STREET | FLORENCE | AL | 35631 | AL - Birmingham | 24 | 37560.37 | 5434.95 | 4453.79 | 130438.80 | 106890.96 | 901448.88 | South |
| 3 | 039 - EXTRACRANIAL PROCEDURES W/O CC/MCC | 10011 | ST VINCENT'S EAST | 50 MEDICAL PARK EAST DRIVE | BIRMINGHAM | AL | 35235 | AL - Birmingham | 25 | 13998.28 | 5417.56 | 4129.16 | 135439.00 | 103229.00 | 349957.00 | South |
| 4 | 039 - EXTRACRANIAL PROCEDURES W/O CC/MCC | 10016 | SHELBY BAPTIST MEDICAL CENTER | 1000 FIRST STREET NORTH | ALABASTER | AL | 35007 | AL - Birmingham | 18 | 31633.27 | 5658.33 | 4851.44 | 101849.94 | 87325.92 | 569398.86 | South |
<Axes: xlabel='Provider Region'>
From above charts we can see the data divided among the regions of USA and south is the region with most providers, while west has the least number of providers.
(163065, 16)
- Above we see the shape of the dataframe and see that df has 163K clean rows to be analysed, and the categorical variables in the df shows how many different variation of data exist in df.
- This diversity underscores the complexity of the dataset and necessitates a careful examination of these categorical variables for a comprehensive analysis.
- Hence below we are going to calculate z score for 20 combination of Categorical and Continous variable, these 20 combination will give us the 20 feature that we are seeking from the data.
Summary: These feature represents the standardized score of the total number of discharges, Average Covered Charges, Average Total Payments, Average Medicare Payments by a provider region, DRG Definition, Hospital Referral Region, Provider City, Provider Name respectively.
Insight:
An unusually high or low number of discharges, could indicate irregularities in patient admissions or discharge practices. This could be a sign of fraudulent activities like patient churning or unnecessary admissions.
Extreme values in this features may indicate that a supplier charges much more or less than competitors in the same location. This might suggest service overcharging, service undercharging as part of a bribe fraudulent activity, or service fraud.
Total payments that are unusually high or low may indicate billing problems. High levels may indicate overutilization of services, while low values may indicate a lack or unreported services.
Significant differences in Medicare payments can be indicators of fraudulent activity, such as false claims or services not provided. Both abnormally high and unusually low results require additional investigation.
| Total Discharges_z-value_PR | Average Covered Charges_z-value_PR | Average Total Payments_z-value_PR | Average Medicare Payments_z-value_PR | |
|---|---|---|---|---|
| 0 | 0.908503 | -0.013146 | -0.461808 | -0.459220 |
| 1 | -0.573785 | -0.601873 | -0.460301 | -0.426420 |
| 2 | -0.381280 | 0.138641 | -0.511775 | -0.506951 |
| 3 | -0.362030 | -0.639300 | -0.514314 | -0.556945 |
| 4 | -0.496783 | -0.057052 | -0.479167 | -0.445712 |
Discharges Total_z-value_PR: A high positive z-value shows that a provider has more discharges than the region's standard, which might imply significant patient turnover or over-admissions in comparison to competitors. A negative z-value indicates fewer discharges than predicted, which might be related to lower admission rates or stricter admission requirements.
Covered Charges Average_z-value_PR: Positive z-values show that a provider's charges are greater than the regional average, which could indicate overcharging or higher service costs. Negative numbers may indicate undercharging as a result of reduced operating expenses or alternative pricing methods.
Average Total Payments_z-value_PR: This measure compares the provider's total payments to the regional average. A greater z-value may indicate possible overutilization or overbilling of services, whilst a lower value may imply underbilling or more efficient service supply.
Medicare Average Payments_z-value_PR: Positive z-values indicate that the provider receives more from Medicare than the area average, which might be attributable to increased service utilisation or invoicing for more sophisticated treatments. Negative z-values may imply decreased reliance on Medicare, either as a result of a patient mix with less Medicare coverage or more cautious billing procedures.
Total Discharges_z-value_DRG: A high z-value, like the 2.73 for the first entry, suggests significantly higher patient discharges compared to others in the same DRG category, which might indicate high treatment frequency or potential overuse of services. Negative values, like -0.78, indicate lower than average discharges, which could mean more selective patient treatment or potentially insufficient care.
Average Covered Charges_z-value_DRG: Positive z-scores imply that a provider's charges are higher than the average for that DRG category, which could suggest overpricing or specialization in more complex procedures. Negative scores suggest lower-than-average charges, which could reflect more efficient care, lower cost structures, or potentially undercharging.
Average Total Payments_z-value_DRG: This metric reflects whether the provider's total payments are high or low relative to the DRG average. For example, a z-score of -1.03 indicates payments are lower than average, which may raise questions about the provider's service quality or billing practices.
Average Medicare Payments_z-value_DRG: These z-scores indicate how a provider's Medicare payments compare to the average for the DRG. Negative values, such as -1.15, could suggest a provider is receiving less Medicare reimbursement than peers, possibly due to lower service utilization or conservative billing.
| Total Discharges_z-value_Reff | Average Covered Charges_z-value_Reff | Average Total Payments_z-value_Reff | Average Medicare Payments_z-value_Reff | |
|---|---|---|---|---|
| 0 | 1.359967 | 0.228345 | -0.273428 | -0.276261 |
| 1 | -0.637461 | -0.551520 | -0.357915 | -0.307998 |
| 2 | -0.351035 | 0.044807 | -0.423086 | -0.408636 |
| 3 | -0.322393 | -0.581659 | -0.426301 | -0.471111 |
| 4 | -0.522891 | -0.112782 | -0.381801 | -0.332107 |
Total Discharges_z-value_Reff: A high z-score, like the 1.36 for the first entry, indicates that a hospital has a higher number of discharges compared to the average in its referral region. This could suggest a higher patient volume or a tendency to admit and discharge more frequently than other hospitals in the area.
Average Covered Charges_z-value_Reff: This measures how a hospital's average charges for services compare to the regional average. A positive z-score indicates higher than average charges, which might reflect a hospital's pricing strategy or suggest that it provides more specialized or intensive services.
Average Total Payments_z-value_Reff: A negative z-score here suggests that a hospital is receiving lower total payments than the regional average, possibly indicating efficiency in service delivery or challenges in obtaining reimbursements.
Average Medicare Payments_z-value_Reff: Negative z-scores suggest that a hospital's Medicare payments are less than the regional average. This could raise questions about the hospital's Medicare patient demographics, the mix of services provided to Medicare patients, or its Medicare billing practices.
| Total Discharges_z-value_ProvCity | Average Covered Charges_z-value_ProvCity | Average Total Payments_z-value_ProvCity | Average Medicare Payments_z-value_ProvCity | |
|---|---|---|---|---|
| 0 | 0.832397 | -0.253140 | -0.456736 | -0.444381 |
| 1 | -0.819453 | -0.147131 | -0.376991 | -0.293387 |
| 2 | -0.576670 | -0.141190 | -0.534900 | -0.524413 |
| 3 | -0.443947 | -0.749566 | -0.537415 | -0.540631 |
| 4 | -0.569835 | -0.358372 | -0.468360 | -0.385997 |
Total Discharges_z-value_ProvCity: Positive z-scores, like 0.83, suggest that a provider has more discharges than the average for their city. This might indicate higher patient throughput. Negative z-scores, such as -0.82, show fewer discharges than the city average, potentially pointing to a smaller operational scale or more selective patient admission.
Average Covered Charges_z-value_ProvCity: The z-scores indicate whether a provider's charges are above or below the city average. Most values shown are negative, suggesting these providers generally charge less than the city average for their services.
Average Total Payments_z-value_ProvCity: These z-scores compare the total payments received to the city average. Most providers in the table have negative z-scores, implying they receive less in payments compared to the city average, which could be due to a variety of factors including the types of services provided, patient demographics, or efficiency in service delivery.
Average Medicare Payments_z-value_ProvCity: Similar to total payments, these z-scores compare Medicare payments against the city average. Negative values, such as -0.54, suggest a provider's Medicare reimbursements are below the city average, possibly indicating less reliance on Medicare billing, fewer services provided to Medicare patients, or more conservative billing practices.
| Total Discharges_z-value_ProvName | Average Covered Charges_z-value_ProvName | Average Total Payments_z-value_ProvName | Average Medicare Payments_z-value_ProvName | |
|---|---|---|---|---|
| 0 | 0.600564 | -0.102496 | -0.492153 | -0.486499 |
| 1 | -0.819453 | -0.147131 | -0.376991 | -0.293387 |
| 2 | -0.534283 | 0.023227 | -0.490562 | -0.477653 |
| 3 | -0.273884 | -0.736364 | -0.498389 | -0.532817 |
| 4 | -0.569835 | -0.358372 | -0.468360 | -0.385997 |
Total Discharges_z-value_ProvName: Positive z-scores (like 0.60 for the first entry) suggest a provider has a higher number of discharges compared to the average for that provider name. This might suggest a higher patient volume or different operational practices. Negative z-scores indicate fewer discharges, which could be due to various factors such as smaller facility size or patient demographics.
Average Covered Charges_z-value_ProvName: This metric reflects the provider's pricing relative to the average for that provider name. Most values in the table are negative, which suggests these particular providers charge less for services than the average for their provider name group.
Average Total Payments_z-value_ProvName: Negative z-scores across the board suggest that these providers receive lower total payments than the average for their provider name group. This could indicate more efficient service provision, a different payer mix, or possibly an underutilization of services.
Average Medicare Payments_z-value_ProvName: Similar to total payments, these z-scores reflect how Medicare payments to the provider compare to the average for their provider name group. Negative scores, such as -0.53 for the third entry, could suggest lower reimbursement rates from Medicare, potentially due to a variety of billing practices or service offerings.
Total Discharges 92 Average Covered Charges 92 Average Total Payments 92 Average Medicare Payments 92 dtype: int64
<class 'pandas.core.frame.DataFrame'> Index: 162965 entries, 0 to 163064 Data columns (total 20 columns): # Column Non-Null Count Dtype --- ------ -------------- ----- 0 Total Discharges_z-value_PR 162965 non-null float64 1 Average Covered Charges_z-value_PR 162965 non-null float64 2 Average Total Payments_z-value_PR 162965 non-null float64 3 Average Medicare Payments_z-value_PR 162965 non-null float64 4 Total Discharges_z-value_DRG 162965 non-null float64 5 Average Covered Charges_z-value_DRG 162965 non-null float64 6 Average Total Payments_z-value_DRG 162965 non-null float64 7 Average Medicare Payments_z-value_DRG 162965 non-null float64 8 Total Discharges_z-value_Reff 162965 non-null float64 9 Average Covered Charges_z-value_Reff 162965 non-null float64 10 Average Total Payments_z-value_Reff 162965 non-null float64 11 Average Medicare Payments_z-value_Reff 162965 non-null float64 12 Total Discharges_z-value_ProvCity 162965 non-null float64 13 Average Covered Charges_z-value_ProvCity 162965 non-null float64 14 Average Total Payments_z-value_ProvCity 162965 non-null float64 15 Average Medicare Payments_z-value_ProvCity 162965 non-null float64 16 Total Discharges_z-value_ProvName 162965 non-null float64 17 Average Covered Charges_z-value_ProvName 162965 non-null float64 18 Average Total Payments_z-value_ProvName 162965 non-null float64 19 Average Medicare Payments_z-value_ProvName 162965 non-null float64 dtypes: float64(20) memory usage: 26.1 MB
| Total Discharges_z-value_PR | Average Covered Charges_z-value_PR | Average Total Payments_z-value_PR | Average Medicare Payments_z-value_PR | Total Discharges_z-value_DRG | Average Covered Charges_z-value_DRG | Average Total Payments_z-value_DRG | Average Medicare Payments_z-value_DRG | Total Discharges_z-value_Reff | Average Covered Charges_z-value_Reff | ... | Total Discharges_z-value_ProvCity | Average Covered Charges_z-value_ProvCity | Average Total Payments_z-value_ProvCity | Average Medicare Payments_z-value_ProvCity | Total Discharges_z-value_ProvName | Average Covered Charges_z-value_ProvName | Average Total Payments_z-value_ProvName | Average Medicare Payments_z-value_ProvName | Outlier | Outlier_knn | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0.91 | -0.01 | -0.46 | -0.46 | 2.73 | 0.16 | -0.80 | -0.64 | 1.36 | 0.23 | ... | 0.83 | -0.25 | -0.46 | -0.44 | 0.60 | -0.10 | -0.49 | -0.49 | 0 | 0 |
| 1 | -0.57 | -0.60 | -0.46 | -0.43 | -0.78 | -0.96 | -0.79 | -0.47 | -0.64 | -0.55 | ... | -0.82 | -0.15 | -0.38 | -0.29 | -0.82 | -0.15 | -0.38 | -0.29 | 0 | 0 |
| 2 | -0.38 | 0.14 | -0.51 | -0.51 | -0.33 | 0.44 | -1.03 | -0.89 | -0.35 | 0.04 | ... | -0.58 | -0.14 | -0.53 | -0.52 | -0.53 | 0.02 | -0.49 | -0.48 | 0 | 0 |
| 3 | -0.36 | -0.64 | -0.51 | -0.56 | -0.28 | -1.03 | -1.04 | -1.15 | -0.32 | -0.58 | ... | -0.44 | -0.75 | -0.54 | -0.54 | -0.27 | -0.74 | -0.50 | -0.53 | 0 | 0 |
| 4 | -0.50 | -0.06 | -0.48 | -0.45 | -0.60 | 0.07 | -0.88 | -0.57 | -0.52 | -0.11 | ... | -0.57 | -0.36 | -0.47 | -0.39 | -0.57 | -0.36 | -0.47 | -0.39 | 0 | 0 |
5 rows × 22 columns
<seaborn.axisgrid.PairGrid at 0x232657854d0>